Data Transformation using dbt Cloud

dbt Cloud is a hosted enterprise-ready platform that simplifies managing and running transformation pipelines, providing visibility and governance around them. The integration of dbt Cloud with source control repositories simplifies version control of dbt code. This makes collaboration between the development team easy while CI/CD automation enables testing the dbt code before pushing it to production.

Calibo Accelerate supports data transformation using dbt Cloud. You can add dbt as a standalone node in the data transformation stage or you can connect it to a data lake and create a pipeline. In the standalone mode of dbt, you can import dbt projects into the platform and run existing jobs or create and run new jobs. If you connect dbt to a data lake and create a pipeline you can create a new job and run it.

Before you create a dbt transformation job, ensure that the dbt Cloud connection details are added to the Calibo Accelerate platform. See Configure dbt Cloud Connection Details

See Configure dbt Cloud Connection Details

Prerequisites

To create or run a data transformation job using dbt Cloud existing connection, you must complete the following prerequisites:

  • Get access to a dbt Cloud configuration listed under Configuration > Cloud Platform Tools & Technologies > Data Integration and Data Transformation.

  • Identify the combination of source and destination that you want to use for the data transformation job using dbt Cloud.

To create a dbt transformation job using a standalone node

  1. Sign in to the Calibo Accelerate platform and navigate to Products.

  2. Select a product and feature. Click the Develop stage of the feature and navigate to Data Pipeline Studio.

  3. Add a data transformation stage and add a dbt Cloud node to the stage.

  4. Click the dbt Cloud node. You can either run an existing job or create a new one.

  5. To run an existing job, perform step 6. To create a new job, perform step 7.

  6. To run an existing job, do the following:

  7. To create a new job, do the following:

  8. To run the dbt job, do one of the following:

    • Publish the pipeline. Click Run Pipeline on the home page of DPS.

    • Publish the pipeline. Click the dbt node and click Start in the side drawer.

To create a pipeline using dbt transformation node

  1. Sign in to the Calibo Accelerate platform and navigate to Products.

  2. Select a product and feature. Click the Develop stage of the feature and navigate to Data Pipeline Studio.

  3. Add a data transformation stage and add a dbt Cloud node to the stage. Add a data lake stage and add a Snowflake node to the stage. Connect the nodes to and from each other like a typical data transformation pipeline.

  4. Click the dbt Cloud node and click Create a New Job.

  5. Provide the following information:

  6. Click Complete.

    The dbt Cloud node is successfully configured, and the supported target node is automatically added to the pipeline as implicit node. See Implicit Nodes.

    Note:

    You cannot create manual connections to/from an implicit node, and you cannot delete the implicit node or the stage that contains the implicit node.

  7. To view the implicit connections, do the following:

    1. On the dbt Cloud node, click the expand icon to show the implicit node. The Target node is automatically displayed with the dotted line connectors.

    2. In the pipeline, click the implicit target node. The configuration opens in read-only mode and cannot be edited.

  8. To run the dbt job, do one of the following:

Implicit Nodes

The supported target node that is automatically added to a dbt Cloud transformation node is called implicit node. The connections to the implicit node (Implicit target node to dbt Cloud, then dbt Cloud to implicit target node) are displayed as dotted lines.

Explicit Nodes

The dbt Cloud node in a data transformation pipeline is an explicit node. Unlike other data transformation nodes this node cannot be connected to any other data lake apart from its implicit data lake node.

Related Topics Link IconRecommended Topics What's next? Snowflake Custom Transformation Job